Incorporating Concept Hierarchies into Usage Mining
نویسندگان
چکیده
Web usage mining is being used extensively for Web Personalization. Many algorithms and techniques have been proposed to predict the next user request. Most, however, are limited in terms of their ability to use concept hierarchy and connectivity of the website. Recent studies have shown that conceptual and structural characteristics of the website play an important role in the quality of the recommendation models. In this paper we propose a new technique to incorporate conceptual characteristics of a website into the recommendation models, and use sequence alignment techniques, adapted from the field of bioinformatics, coupled with a new model for defining page similarity. We introduce a scoring methodology to quantify page similarity derived from the concept hierarchy of a website. These scores are an essential ingredient in the sequence alignment technique. Other aspects, like time spent by the user on a page and page access sequence are also considered during the alignment. Thus, the system that we propose makes use of various sources of information to make recommendations. Finally we present experimental results to show the effectiveness of our method.
منابع مشابه
Web Usage Mining in Noisy and Ambiguous Environments: Exploring the Role of Concept Hierarchies, Compression, and Robust User Profiles
Recent efforts in Web usage mining have started incorporating more semantics into the data in order to obtain a representation deeper than shallow clicks. In this paper, we review these approaches, and examine the incorporation of simple cues from a website hierarchy in order to relate clickstream events that would otherwise seem unrelated, and thus perform URL compression. We study their effec...
متن کاملBuilding and Exploiting Ad Hoc Concept Hierarchies for Web Log Analysis
Web usage mining aims at the discovery of interesting usage patterns from Web server log files. “Interestingness” relates to the business goals of the site owner. However, business goals refer to business objects rather than the page hits and script invocations recorded by the site server. Hence, Web usage analysis requires a preparatory mechanism that incorporates the business goals, the conce...
متن کاملIncorporating Concept Hierarchies into Usage Mining Based Recommendations
Recent studies have shown that conceptual and structural characteristics of a website can play an important role in the quality of recommendations provided by a recommendation system. Resources like Google Directory, Yahoo! Directory and web-content management systems attempt to organize content conceptually. Most recommendation models are limited in their ability to use this domain knowledge. ...
متن کاملDiscovering Fuzzy Unexpected Sequences with Concept Hierarchies
Sequential pattern mining is the method that has received much attention in sequence data mining research and applications, however, a drawback is that it does not profit from prior knowledge of domains. In our previous work, we proposed a belief-driven method with fuzzy set theory for discovering the unexpected sequences that contradict existing knowledge of data, including occurrence constrai...
متن کاملMulti-level Association Rule Mining: an Object-oriented Approach Based on Dynamic Hierarchies
Previous studies in data mining have yielded e cient algorithms for discovering association rules. But it is well-known problem that the two controlling measures of support and con dence, when used as the sole de nition of relevant association rules, are too inclusive | interesting rules are included with many uninteresting cases. A typical approach to this problem is to augment the thresholds ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006